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It is not so well-known that measurement-free quantum error correction protocols can be designed to achieve 
fault-tolerant quantum computing. Despite the potential advantages of using such protocols in terms of the 
relaxation of accuracy, speed and addressing requirements on the measurement process, they have usually been 
overlooked because they are expected to yield a very bad threshold as compared to error correction protocols 
which use measurements. Here we show that this is not the case. We design fault-tolerant circuits for the 9 qubit 
Bacon-Shor code and find a threshold for gates and preparation of P( p g ) t hresh = 3-76 x 10~ 5 (30% of the best 
known result for the same code using measurement based error correction) while admitting up to 1/3 error rates 
for measurements and allocating no constraints on measurement speed. We further show that demanding gate 
error rates sufficiently below the threshold one can improve the preparation threshold to P( p ) t hresh ~ 1/3- We 
also show how these techniques can be adapted to other Calderbank-Shor-Steane codes. 

PACS numbers: 03.67.-a, 03.67.Lx 

An ideal quantum computer is a theoretical object capable of highly efficient computation. A major difficulty with the realiza- 
tion of such a powerful theoretical object is that physical implementations of any quantum operation will be noisy. However, with 
the use of quantum error correction (QEC) codes, fault-tolerantly designed circuits, and provided that error rates are below some 
threshold value, one is still able to efficiently simulate a quantum computation with arbitrarily high accuracy [ -3]. Experimental 
state of the art results show that error rates and execution times required for operations in order to achieve the fault-tolerant 
regime are not currently available. The results in this paper will alleviate part of this constraint pushing required error rates a step 
closer to current technology. 

In many physical systems measurements pose a potential bottleneck for scalable fault-tolerant quantum computation because 
they are slower and/or noisier than gates or preparation [4, 5]. However, they are central in the readout stage, and are widely used 
in QEC routines as a way of extracting error syndrome information in order to correct the quantum data. Slow measurements 
have been shown to be a surmountable issue by using error correction where measured error syndromes can be classically post- 
processed at the end of a round of gates to execute a compensating Pauli frame rotation [ ], with the caveat that there can be 
a significant time lag during classical processing [ ]. Regarding noise, measurement error rates cannot usually be improved 
by noise suppression techniques, i.e. dynamical decoupling, whereas gates can be [9, 10]. Furthermore, measurement results 
must be distinguishable in every time step, i.e. one must be able to discriminate between results from different measurements 
repeatedly over the computation, which leads to further constraints on the physical processes executing the measurements, e.g. 
measurements relying on photon scattering as in ion traps. [11] 

In this letter we overcome these problems by eliminating most measurements during fault-tolerant computation. It is well- 
known [1, 12] that this is possible for Calderbank-Shor-Steane codes [ ] such as the Steane code, however " The penalty paid 
in the stringency of the threshold has never been quantified, but it is expected that replacing measurement by coherent operations 
decreases the noise threshold by a large amount"^}. We show that contrary to these conjectures coherent FT QEC suffers only 
slightly in regards to the threshold and brings substantial rewards. 

We begin by setting up our scenario and introducing measurement free error correction (EC) routines for the Bacon-Shor code. 
We then show how to execute fault-tolerant Clifford operations consisting of: (I) preparation of |0) and |+) = (|0) + \ l))/\/2 
states, (II) Clifford group [ ] unitary gates, and (III) measurement in the X and Z basis, and derive a threshold error rate which is 
stringent for preparation and gates but as high as 33% for measurement. We proceed to show that through an encoder circuit we 
can prepare special ancillas at any level of concatenation. While this encoding cannot achieve an arbitrary low error rate (p a m ), 
it is small enough, p[ anc ) < pn-unc = sin 2 (7i/8) ~ 14.6%, to be used as a resource in magic state distillation [ , ](MSD), a 
protocol using exclusively Clifford operations to distill arbitrarily low-error encoded \H L ) = (|0 L ) + e in l A \ \ L ) ) jsfl nuigic states. 
Using this resource to execute non-Clifford gates at the highest level of concatenation completes the universality of our model. 
Moreover, we show how to relax the threshold value for preparation, using a variant of algorithmic cooling and demanding a gate 
error rate, pi g ), sufficiently below the threshold. Thus fault tolerant universal quantum computing (FTUQC) can be achieved 
with measurement and preparation error rates, pr p \ and pr m \ respectively, which are already within reach of current technology. 

We demonstrate our scheme for the 9-qubit Bacon-Shor (BS) subsystem code [ ] but our tools can be adapted to other CSS 
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codes (see B). The BS code is defined by the stabilizer set on a two dimensional array, 

Z Z Z / / / ^| 

Z Z Z ,Z Z z\. (1) 
/ / / Z Z Z) 

For this code logical Pauli operators are given by = Il/=i-^i,i>^L = Yl^=i^i,i modulo stabilizer operations, i.e. Xi (Z{) acts 
on a column (row) of the array. This code is a subsystem code and is invariant under pairs of X(Z) operators along any given 
row(column) because they act only on gauge degrees of freedom. Given the subsystem structure of the code one is able to 
correct acting on only one row (for X-errors) and one column (for Z-errors). The library of physical (level-0) gates we use is 
{X,Z,H,CNOT,TOFFOLI,Z -TOFFOLI = H® 3 (TOFFOLI)H® 3 ,\0) preparation, | +) preparation,!//) 
preparation, Z-measurement,X-measurement}, allowing also for non-local interactions. We adopt an adversarial, local, stochastic 
error model [16]. 

The first obstacle is of course to design a EC routine/gadget which uses coherent feedback instead of measurements & feedback. 
One needs to use more gates within the EC gadgets to execute the coherent feedback and, in particular, one would typically need 
fault-tolerant implementations of TOFFOLI gates at every level. This would yield a very bad threshold value [1, 6]. However, 
during QEC we do not really need a full-fledged TOFFOLI gate since it will only be controlled by ancillas containing the 
syndrome, i.e. classical, information. For instance when correcting X-errors, a Z error in the ancillas is irrelevant, thus we can 
map a BS encoded ancilla to a quantum repetition (QR), i.e. bit-flip, code which protects against X errors but that is vulnerable to 
Z-errors. Using the QR encoded controls, and the structure of the logical operators in the BS code, we can use bitwise TOFFOLI 
gates to implement the needed operation (see Fig. (1(a))). 

The mapping between the BS code and the QR code, of the same level of concatenation, is achieved using the gate 

W ) denotes a bit 



$C{k) : ) — > (s) W ), where ) is encoded in the ^-concatenated BS code ( 9 k physical qubits), and 
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encoded in ^-concatenated QR code (3* physical qubits). From our joint use of BS and QR codes we must also introduce an error 
correction measurement-free routine for the QR code, i.e. states of the form a|0,0,0) We build a majority voting 

gadget, which we dub the 5Vf-gate (Fig. (1(a))). In the QR code all gates involved in the M gate are transversal and thus we can 
use this circuit as an EC gadget for this code at any level of concatenation. Moreover, through the 9\[ gate we can also use 9vt as 



an encoded majority voting gadget, i.e. acting on a state of the form a 0^,0^,0^^ +b 1^' , l K £' , V£' J. By virtue of the fact 

that the Bacon-Shor code is, in essence, a composition of X and Z basis QR codes, we can use !M&d\[ as the building block for 
the BS EC gadget. 

Schematically the BS QEC routine works as follows (we refer the reader to Fig. (1(c)) for a detailed description). The boxed 
part of Fig. (1(a)) is a syndrome extraction stage, and turns the ancilla, initially in a state, into a string which contains the 
error information. We adapt this method to the BS code. In this code, to correct for X-errors, we execute an extraction stage 
in every column of the BS state and get three strings (s\,S2,sj,). We use them to vote into a fourth one .54 = s\ S2 © 53, which 
will control final correction via J\£ and bTOFFOLI. A single error in e.g. column one of the BS state leads to 54 = si which 
would correctly execute the correction by virtue of the gauge freedom; on the other hand a gauge operation, e.g. two X-errors in 
the same row, leads to 54 = s s = which correctly implies an identity correction operation. An analogous analysis holds for 
Z-error correction. 

Now the X and Z correction stages of the BS QEC routine are essentially equivalent but have some differences. 
Because the syndrome information after the syndrome extraction stage is different in both cases, we define 9^ x ^ 
(and i\£ (z '), gates for the BS code at level k of concatenation (see Fig.(l(b))): 9{f- x \k) = nie/wsO^^)/^ ~ 1) = 

ILCWO7( li0i( 3 i0 (*- l)UiCNOT {2M3ji) (k- 1)^-^5)^-1), *6 z \k) = Y\jeco, um ns(V*6 Z) )j(k- 1) =UtCNOT m{lA) (k- 

(z) 

^)Y[iCNOT^T,Y^2){k— l)Ylj^(^ ^(k— 1), where A ^ denotes gate A acting on the qubit in row r and column c of the logical 
qubit. The (X) or (Z) version of the gate is chosen depending on the correction subroutine in which it is being used, e.g. to 
correct X-errors (as in the lower part of Fig. ( 1 (c))), we use 9{j x \ The 9\(j x ^ (iY^ 2 ' ) is a Z(X) decoder, where one keeps only the 
convenient protection while completely unprotectng against the other type of errors. Moreover, after the X syndrome extraction 
stage, the corresponding ancilla does not need protection against Z error, so only the lower stage (ECx) of EC must be used. 
This greatly reduces the overall execution time for encoded gates acting on those ancillas. We found that due to this property, the 
subroutine (V9*£)i(k) not only takes less time, in terms of execution time of level-(A:— 1) gates, but it can be shown to fail with a 
probability smaller than a CNOT(k), for k > 1. For k — 1 EC gadgets, there is no need to use 3\£ since 9\£(0) = Id. We detail this 
in the Supplementary Material. A 

We are now ready to describe the remaining elements of our BS code fault-tolerant scheme. First we describe the elements 
needed to fault-tolerantly simulate any circuit based solely on Clifford operations. (I) Preparation o/|0l) and \+i)states: by (i) 
starting with a 3 x 3 array of |+), and (ii) applying a in every column we can prepare a |+z). Similarly |0l) is obtained 
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Measurement-free QEC routines for the QR and BS code. The inputs are pW^ = |000)® 3 ' 

3Vf gate. An X-encoded majority voting gadget of level-(A-+ 1) of concatenation. Here all CNOTs are bitwise, i.e. each CNOT depicted 
corresponds to three CNOT(k), and subscript R corresponds to a cyclic fc-encoded rotation of the targets of the corresponding gate. In the 

QR code the TOFFOLI gate depicted is bitwise. The M gate can also be designed for a Z-encoded quantum majority voting, with +^ y 

ancillas and the obvious Hadamard conjugation of gates. When the need to distinguish them arises we shall denote X and Z encoded majority 
votings 5Vf( x ) and 5Vf( z ) respectively, (b) A subroutine acting on ancilla for processing error syndrome information extracted from the data. 
The circuit shows one row, (V9Qi{k), of the fully contracted exRec V9\[(k) = \Jh[fk-\- 1)J representing a collection of fc-level protected gates 
acting on row i of ancilla which take part in an EC(k+ 1) step. Note that in this circuit the output top lines are discarded so no EC gadget must 
protect them. With this, the exRec corresponding to 2\£ at degree of concatenation k is d\[(k) = EC(k) x Xh^ raws (V9iC)i(k — 1) x EC(k). In 
our circuits \_G(k) J denotes the implementation of gate G, in terms of level- (k — 1) gates, without the prepended and appended EC(k) routines, 
and W denotes a waiting gate, (c) Full error correction (EC) gadget for the BS code. Here, a TOFFOLI with o controls is a Z— TOFFOLI; 



CX = n? J= i CNOTP] js , { i ., is a set of transversal CNOTs, CX£ 

control of the gates in boxes is always the top input of the gate. The last gate is a bTOFFOLI 



Ulj =l CNOTQ JUt . J±1) mdCX^ 



=iCNOT^. mUt .. y Thc 



by (i) starting with a 3 x 3 array of |0), and (ii) executing a i/Vf' z ' in every row. (II) Clifford group generators: CNOT ,H \Z l l 2 : 
The CNOT gate is transversal, the H gate can also be implemented in a bitwise fashion but, because stabilizers are rotated by 
this action, it is followed up by a physical 7t/2-rotation accommodated by relabeling or rewiring of gates. The Z 1 / 2 gate can 
be implemented using the circuit in Fig. (2(a)), provided one can prepare a logical ancilla in |±/l) = (|0l) ± j'|1z,))/v^- Since 
the Z 1 / 2 gate is not part of the EC routines, it is only needed at the highest level of concatenation. Furthermore, as it is the only 
complex gate, it can be shown that by always using the same logical ancilla prepared in \0r) = 1/ v / 2( | +i£) + 1 — h) ) t° activate the 
circuit in Fig. (2(a)), then the entire quantum computation splits into two noninterfering paths (evolution by U comp and U* omp ) and 
the measurements of real, Hermitian operators at the end have the same expectation values as for evolution by U comp alone [17]. 
Alternatively one can use the distillation circuit in [ ] at the highest level provided one can prepare it with an error rate below 
P(i-unc) — 1/2- (HI) X and Z basis measurements.- They are only required at the highest level of concatenation. Given their form, 
measuring encoded logical operators can be achieved measuring only one row or column of the 9 k x 9 k encoding array. 
Threshold calculation for Clifford operations.- We use the extended rectangle (exRec) method developed in [ ] to compute the 
threshold (see Supplementary Material for more details). An exRec of a gate is constructed by prepending and appending error 
correction routines on the inputs and outputs. The exRec with the largest number of malignant pairs, i.e. the number of pairs of 
faults which generate two or more errors in the data, will determine the threshold value. A quick inspection reveals that the largest 
exRec is the one corresponding to the CNOT gate. Following [ ], only at level k = 1 must one consider all elements: preparation 
and gates (including waiting gates). At level k > 1, using contraction of exRecs, preparation locations can be omitted. This means 
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that one has to solve the recursion relationships for the error at level j: 

P W <A{ k=1) (p®) 2 ; p« <A\ k>l) {p^)\ for*>l, (2) 



where A'^ = ^ip ^1 + ^/l + jj^y; ) > B denotes all possible three-site errors, and denotes the number of malignant pairs in 

the largest exRec of that level. This process can be repeated for four site errors, etc. to get an even tighter bound [ ]. Executing 
this algorithm with our largest exRec, the CNOT, we obtain a threshold value, for preparation and gates, pUgjthresh = 3-76 x 10~ 5 . 
This value is not a bound for measurement error rates since they are not needed during the QEC process and are only required at 
the highest level of concatenation. So it follows that 

Pt) 1 ' * 3(pg) 2 + 0(p«). (3) 

If preparation and gate error rates are below threshold, then for k large enough p( k > is vanishingly small and the terms 0(pW) 
can be neglected. Then the threshold condition foiX and Z measurements is P( m ) t hresh = 1/3. 

Encoded non-Clifford operations.- The missing component to achieve universality is the FT execution of a non-Clifford gate. 
Using the circuit in Fig. (2(b)) we translate the problem into preparing the \H£) ancilla. To create an ancilla at the highest 
level we will use an encoder circuit which will allow us to keep the P( m )thresh < 1/3- To encode an arbitrary state we use 
the following algorithm: (i) we start with the level-0 state |(|)) we want to encode and 8 |0) states, then (ii) we use CNOT 
gates, including waiting times such that never in one step does one qubit interact with more than one qubit, to create the state 

1 ) . Finally (iii) we execute a fM^ gate in every row, to create the state = a \0l) +b 1 1/,). We can 

/ 3x3 



♦ = a 

3x3 
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recursively use the same algorithm to create the state at any level of concatenation k. Repeating this process recursively yields 

[one) 



an error rate for the encoding at the highest level of concatenation k — L, p^c < 10/?® + 108J^Lq ■ Clearly p^ nc -\ cannot be 
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FIG. 2: These circuits need only be implemented at the highest level of concatenation, and thus all operations depicted are encoded operations, 
(a) Circuit used to execute an encoded Z 1 / 2 Clifford gate on an arbitrary input \tyz.) [17] . (b) Circuit implementing the non-Clifford operation 
Z 1 ' 4 " given the encoded resource magic state \Hi). 

made arbitrarily small, however, provided pty < pthresh> it can be made small enough to give p^ m ) — sm2n /8, and then one can 
use MSD to achieve FTUQC [8] . 

Additionally, we promised that preparation errors can in fact be much higher than gate error rates. The argument proceeds by 
using a variant of the algorithmic cooling algorithm introduced in Ref. [18]. For a group of three qubits (a,b,c) with identical 
probabilities p^ = e'°) < 1/2, to be in the erroneous state |1), we apply TOFFOLI^ c j,y a ^CNOT^ ac )CNOT^j,). The reduced 
state of qubit a is colder, i.e. has lower error (eW < e^). Concatenating the process, after j rounds using a total of 3 ; qubits, 
the final error of the one output qubit satisfies the recursion relation = (ew -1 ') 2 (3 — 2e^' -1 )). Including gate errors, the total 
error of this preparation process is p9\ < + ^(V — ^)p^l- 

We are now ready to combine our tools. If we are sensibly below threshold, say with p(g } p) — 0.75/7^ p\thresh — 

2.82 x l(r s , 

then with p^ = 33% we get p^l ~ 10~ 13 and p^Jc = 8.32 x 10~ 3 which is safely below the 14.6% needed for \H£) distillation 
(and certainly below the 50% needed for the \+ii) distillation [3]). Thus FTUQC is achievable with noisy and currently 
achievable measurement error rates, but with only a small impact to the threshold value as compared to the best known result 
(1.26 x 1(T 4 ) for the same code allowing measurements [ ]. One can go further and use algorithmic cooling to also push 
preparation error rates within reach of current technology. We find that if one has physical preparation error rates of pu,) = 1%, 
then two rounds of AC and physical gate error rates pt g \ = 2.32 x 10~ 6 allow for FTUQC. Preparation rates as high as 1/3 can 
also be allowed, at the cost of demanding a lower gate error rate. For pi v \ > 1/3, one can instead use noisy measurement since 
measurement followed by a unitary is preparation. 



5 



To put this result in perspective, notice that pu\ = 1 .39 x 10~ 6 is not a threshold value but the required value such that effective 
preparation and gate error rates are sensibly below our threshold (0.75 x pthresh)- In comparison, under the same assumptions 
the best known result [ ] implies that quantum computing is possible, with reasonable overhead, when prp^,m) ~9.5 x 10~ 5 . 
So the price we pay to push measurement and error rates within reach of current technology (an improvement of three and two 
orders of magnitude respectively), is demanding roughly two orders of magnitude more stringent gate error rates. The result is 
even more significant if one considers recent results which show that arbitrarily accurate unitary gates (and not measurement 
and preparation) can, in principle, be achieved via open system control strategies [10]. Furthermore note that the required 
measurement and preparation error rates have already been reported: in trapped ions [4], pi m \ — 2.3 x 10~ 3 while in quantum 
dots [5], p {m] = 3 x 10~ 2 . 

We point out that the threshold value for gates computed here is by no means tight as we wanted to keep calculations simple. 
We have overcounted malignant pairs of locations, and certainly the design of our circuits may not be the optimal one in terms of 
error locations, thus in principle the threshold can be improved. On the other hand, restricting ourselves to two-qubit interactions 
only, and decomposing TOFFOLI gates into one and two qubit gates degrades the gate and preparation threshold value to 
2.69 x 10~ 5 . Also restricting to nearest-neighbor only interactions will degrade the threshold value [19]. In our circuits ancillas 
can be prepared offline and we have been careful to limit measurement only to when the data is encoded (at the highest level of 
concatenation), thus physical systems with slow measurement or preparation are allowed. 

In conclusion, we have shown that measurement-free QEC is viable, considerably relaxing the time and error rate constraints 
on preparation and measurement operations, and pushing them within reach of current technology, while yielding only a small 
penalty to the gate threshold. This small penalty seems even less relevant if one considers recent results showing that arbitrarily 
accurate unitary gates can, in principle, be achieved using open system control [10]. Those results complement the methods 
developed here and bring fault-tolerant quantum computing closer to reality. 
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Appendix A: Details of the gadgets and the threshold calculation 

Central to the construction of our error correction routine is the 9\[(k) gate, which acts as a map between the Bacon Shor (BS) 
and the Quantum Repetition (QR) code of the same level of concatenation k. It is not evident from the start that 9£(k) takes 
the same amount of time as a gate from our gate library of the same level, nor is it obvious that the failure probability of the 
corresponding exRec is smaller than that of a CNOT exRec. We show here that indeed this is the case and use these attributes to 
compute the error threshold. 



1. The 9i gate 

a. Execution time of0\[ 

One of the main properties we use in our threshold calculation and in our circuits is that 9\[(k) takes less time than a fully 
protected gate of the same degree of concatenation k, i.e. T(fJ\[(k)) < T(G(k)), where T(A(k)) denotes the execution time of the 
protected gate A. By protected we mean the gate has EC gadgets prepended and appended to the gate. The key observation to 
prove this is that whenever 9^ is used, the state only needs protection against one kind of error, for example during the X error 
correction stage the ancilla only needs protection against X-errors. So to achieve this protection we execute the EC gadget, ECx, 
without the Z error correction stage. The same analysis follows for the ECz- In this section we will prove relations explicitly for 
9{j x \k), and thus will omit the X or Z superscript, but the reader should have in mind that the same results hold for 9{j z \ 

With this in mind, we begin by comparing a fully protected exRec of a gate G at level-£ of concatenation with 9\[(k). The 
relevant gates can be decomposed as 

G(k) = EC(k)o[G(k-l)\oEC(k) 
9{(k) = EC x (k)o[*C(k-l)\oEC x (k) 

where the notation \A(k— 1)J denotes the implementation of the A(k) gate in terms of (k — l)-level protected gates but omitting 
&-level protection. At level k= 1, \A(k — 1)J corresponds to the physical gates implementing the encoded gate. Moreover a 
contracted extended rectangle (exRec) A(k) is composed by the implementation of the gate in terms of level- (k — 1) gates and 
level-£ error correcting gadgets in all the inputs and relevant outputs. We use the notation G(k) to denote a fully protected gate 
made up of a single step of level k — 1 gates, for example a transversal CNOT(k — 1 ) gate. In contrast the 9\[(k) consists of more 
than one step of level-(£— 1) protected gates. To calculate a bound on the time needed perform our error correction, first notice 
that the full error correction gadget as illustrated in Fig. (1(c)) consists of X and Z error correction which both consist of the 
same number of gates and overlap in all but two locations (neither of which is an C\[(k) gate). Hence, regardless of the structure 
of 9\[(k) or it's time duration we find 

T{EC{k)) = T(EC x (k))+2T{G(k- 1)). (Al) 

Moreover, given the structure of 9\[(k) (see Fig. (1(b))), it follows then that 

T(9i(k)) = 2T(ECx{k))+2T(G(k~l)) + T(9£(k-2)) 
T(G(k)) = 2T(EC(k)) + T(G{k-l)) 

= 2T(ECx{k))+5T(G(k-l)), (A2) 



which in turn implies that 



T(G(k)) - T(Zt(k)) = 3T(G(k- 1)) - T{9t{k- 1)). (A3) 



At level k — 1 we do know the form and time-duration for all gates in the circuit, and we have that T(G(l)) — T(9\[(l)) — 
3T(G(0)) - T{9£(0)) = 3J(G(0)) > 0. So an inductive reasoning leads us to 

T(G{k)) - T{*C{k)) > 2T(G(k- 1)) > 0. (A4) 



Even more, in the error correction gadgets of level-fc we used the subroutine V9{JJi— 1) which is composed of level k—1 gates 
acting on 9 level k — 1 BS encoded inputs and 3 level k — 1 QR encoded outputs. For analysis of this module, the sequence of 
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operations can be decomposed as follows 

V$C(k-l) = ECx{k-l)o[G{k-l)\oEC x {k-l)o[G{k-l)\o9i{k-l) 

= EC x (k-l)o[G(k- 1)J oEC x (k- l)o [G(k- 1)J o£C x (^-l)oG(^2) 

oG(Ar - 2) o - 2) o iW(jfc - 1) 
= £C X (Jt - 1 ) o [G(k - 1)J o [G(/t - 1)J o G(Jt - 2) o G(Jt - 2) o fA£(jfc - 2) o 5Vf (jfc - 1 ) 

where in the last equality we have used the exRec-contraction technique from Ref. [3], and used the fact that only the lower 
output of the gate will be used. The execution time of this contracted exRec satisfies 

T{V9i(k-l)) < 2T(EC x (k-l))+4T(G(k-2)) + T{9i{k-2)) 

< 2T(EC x (k-l))+5T(G(k-2)) 

= 2T(EC(k-l)) + T(G(k-2)) 

= T(G(k-l), (A5) 

where we have used the fact that T(ECx) > T(9vt) to go from line one to two, Eq. (A4) from two to three and Eq. (Al) to get 
the last equality. This shows why V9^(k— 1) takes one level k—l time slot in our circuits. 



b. Error contribution of 9\[ 

Another relevant property for our threshold calculation is the failure probability of a 5\£ gate at some level k. For instance, 
if its failure probability was greater than that of a CNOT of the same level, we would have to equate that into our threshold 
calculation. We want to show something even stronger: during EC(k) one uses the subroutine (V9*Q(k— 1) (see Fig. (1(b))) 
which can be further decomposed as (V$£)(k) — Yheraws(y9£)i{k)< we w iU show that the error probability of the contracted 
exRec corresponding to the collection of level-(£— 1) gates (V9£)i(k— 1) is not larger that of a CNOT of level-(£— 1). In a more 
fundamental way we will show that the exRec with the highest failure probability is the one corresponding to the CNOT gate. As 
in the previous section, we will prove all relations for the 9{j x \ To simplify notation we drop the X super or sub scripts when 
necessary but remind the reader that the analysis holds for both X and Z related routines. 

The sketch of the calculation is the following. At an arbitrary level of concatenation k, the failure probability of an exRec will 
depend on the failure probability of gates and of 3\£ of lower level- (£ — 1), but since we do not know ab initio what is the failure 
probability of (VS\Qi(k), we only know its failure probability must be larger than the 9{Jk) one, we cannot directly compute a 
threshold condition. Fortunately we know what is the specific form of (FfA£);(l) at level k = 1 in terms of level-0 gates, and 
we can directly compare it with CNOT(l). In general, this comparison can be done first counting the number of malignant pairs 
of level-(& — 1) errors within a gate G exRec of level-fc, A(G(k)) and written in terms of various malignant-error parameters 
{Aecym,...} which we will define below. The error probability of such exRec is then given by 

pf=* G(k) {pV? (A6) 

where G corresponds to the gate of level k—l with the highest failure probability, A' g q^ — ^1 + ^1 + p-^ji^ > an d ^G(k) 

denotes all possible three-site errors in the exRec. Once we show that at level k—l, the CNOT is the largest exRec, then we 
can do the analogue level k = 2 calculation but now replacing the failure probability of (VfA£)/(fc) with now with parameters 

{A Ec(k=2) )■■*)}• An inductive reasoning will finally lead us to the comparison at any level of concatenation: p^ N0T > p ^ ■ 
The malignant error parameters are defined as: 

• Aec (Aec x ) tne number of malignant pairs in an EC (ECx)', 

• u (ux) the number of single failures in an EC (ECx) which generate a single error in the data; u (iix is similar but restricting 
the error to be in only 8 out of the 9 qubits in the encoded data. This case is important when we have an errors propagating 
through a CNOT; 



• a (ax) the number of single failures in an EC (ECx) which, in conjunction with an incoming data error, generate a double 
error in the data; 
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• Parameters A^, m, m and p can be defined for the quantum repetition code and its error correcting gadget, the 9vt gate. 

because at this point we have to assume that the size of the circuits varies with every level of concatenation, then each parameter 
will have a (k) denoting the level of concatenation it corresponds to. 

Let us now proceed with the calculation. The number of malignant pairs in the CNOT(l), (VJ\0,-(1), and exRecs are 

then given by 

A CNO t(\) < (4A BC (i) + 16k (1 ) +«(i)«(i) +4M0C( 1 ) + 18a (1 )+ 36) 
A {V9C)i(\) ^ (3 a ec x (i) +3«x(i)«x(i) +66m X (i) +3m X (i)P(i) +33P (1 ) +363), 

A bTOFF(\) = (2A £C(1 )+2A M(1) )+m (1) m (1) +2« (1) m (1) +M( 1 )a + 2m (1) a + 8M (1 ) + 16w (1) +9a (1) +36 

while a direct count gives the value for the malignant-error parameters {A EC n\ — 4182, Ur\\ — 63, fim = 56, am = 42,A ECx ^ = 
2031,m x(1) =45,fi x{ i) = 30,a x[l) = 20,A M(1) = 177,m (1) = 12,m (1) = 8,p (1) = 5,}. This yields p^ = (11836)Cp(°)) 2 < 

\ (33036) (p(°>) 2 = \pclor and PbTOFF = ( 14784) (p(°)) 2 < ± (33036) (p™) 2 = \pc ] N0T - Now, for/t = 2, we obtain the following 
failure probabilities, using that bTOFFOLI(l) and (V^Qi(l) fail with half the probability of a CNOT(l) gate. 



A CNOT(k) = (4A EC[k) + 16u [k) + u {k) u {k) +Aua [k) + 18a w + 36) 

A {V*Qi{k) = (3 A EC x (k) + A M(k)+3ux( k )Ux( k )+72u X ( k )+3u X ( k - ) $( k )+36$( k ) +432), 

A bTOFF(k) = ( 2A EC(k) + 2A M(k))+ m (k)m(k)+ 2u (k)m(k)+ u (k) a -(k) + 2m (k)U(k) 

+8«( fe ) + I6m {k) + 9a (jt) + 36 (A7) 

with corresponding parameter values {A EC{k>1) = 1953H (lt>1 ) = 63,5^!) = 56, a (t>1 ) = 33, A EC x {k>\) = H28, u x{k>l) = 
45, S x(i:>1) = 30, a x{k>l) = 16, A M(fc>1) = 105, w (jfc>1) = 12, m (lt>1) = 8, P (fc>1) = 4, }. A direct calculation shows again 

^(V3\0 \Pcnot and PbTOFF < iPcnot- F rom tms point on, the structure of the level k error correction circuits, and thus the 
corresponding malignant error parameter values, are the same of the level k = 2 circuits, so repeating the process for k = 3, 4, k 
leads us to the conclusion that the CNOT exRec is in fact the largest exREC to be considered and the one which will determine 
our threshold value. 



EC 



-{ec] 6 [ec]- 

(a)CNOT(k) exREC 



EC] [EC] (p [EC 



■ECx(k) —{W}- 



■EC x (k) 



M 4 



(b)bTOFFOLI(k) exRec 











^(^(jfc-i) 




M ix \k-\) 



(c)V9d(k) contracted exREC 



FIG. 3: The largest exRecs to be considered. An EC gate corresponds to a BS QEC routine while a M gate corresponds to a QR QEC routine. 
The circuit (3(c)) is executed in every row of the 3x3 array. Because in the exRec we are discarding the top-lines we do not require 
output M gadgets appended to them. Moreover, at level k = 1 there is no need for the waiting (W) gate and both CNOTs can be executed 
simultaneously. 



2. Error analysis for the encoder circuit 

The error analysis for the encoder circuit is as follows: to encode a level k state provided a level k— 1 state, we have that step 
(i) uses 8 CNOTS, 20 waiting gates, and 8 |0) preparations failing with probability and step (ii) can introduce unwanted 

phases with a single error (note that this is not a problem in Clifford ancilla preparations e.g. |0) states.) thus we count all 
locations in the M gates. A contributes with 27 level-0 locations, while a M(k), for k > 1, contribute with 24 level-(k-l) 

locations. So we have 

pE<10// 0) + 108 J>W (A8) 



which justifies our encoder circuit error analysis. 
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3. Threshold calculation 

To find the threshold we must now solve the system of equations 

PCNOT - A CNOT(k=\)\P ) ' 
PCNOT — A CNOr(k>\)\PcNOT) i tor «> 1 i 

which in turn gives that, P( Ptg )thresh — r=, 1 , = 3.76 x 10~ 5 . Because no measurement exRec of lower levels had 

\/ A CNOT (k= 1 ) A CNOT(k> 1 ) 

to be used in our calculations, this threshold only applies to gate and preparation. To find what is the Clifford measurement error 
rate threshold , we note that at the highest level of concatenation 



(A9) 
(A10) 



< A EC(k){P 



(All) 



So if preparation and gate error rates are below threshold, then for k large enough pW is vanishingly small and the first two terms 
can be neglected. Then the threshold condition for X and Z measurements is pt m )thresh = 1/3' 



a. Threshold calculation with two-qubit interactions only 

In our library of gates we assumed the possibility of executing 3-qubit gates in the form of a TOFFOLI gate. Threshold 
computations using this gate are standard, see e.g.[l]. Although for some architectures it is reasonable to include three qubit 
gates in the gate library [20], but for others two qubit interactions are mor e natural. Here we show how to compute the threshold 
when decomposing TOFFOLI gates into one and two qubit gates. The main observation is that this decomposition only affects 
the level- 1 of the concatenation level, where we actually replace the TOFFOLI as shown in Fig. (4) Lets recall the level- 1 
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FIG. 4: Decomposition of a physical TOFFOLI gate into two qubit interactions. Note that it requires only three time-steps as the first two 
CX 1 / 2 s can be executed simultaneously. The last CNOT gate is not necessary as we will typically discard the controls of such TOFFOLI and 
thus they do not count towards our threshold estimation. 

bTOFFOLI for the BS code is composed by 3 physical TOFFOLI targeting one row, and controlled by three classical three bit 
strings, and 6 waiting gates. Since a physical TOFFOLI can be executed in three time steps and is composed of five gates, then 
the encoded bTOFFOLI gate is composed of 3x6 + 3x5 = 33 physical gates. Note that this decomposition mainly affects 
level- 1 of concatenation, as we are only decomposing the physical gate and not emulating such decomposition at all levels of 
concatenation. If we input this in our analysis, an analogue hierarchy of exRecs is maintained i.e. PbTOFF(k) < \PcNOT(k) an d 
P(v$Qj(k) — \PCNOT(k)i which in turn yields a threshold of ph, ^thresh = 2-68 x 1CT 5 now limited to two-qubit gates only. 



Appendix B: Tools for other CSS codes 



At a first glance the tools developed here are specialized for the 3 x 3 BS code, however we want to show that they can also be 
adapted to other CSS codes. We present here some tools which will be necessary to generalize our methods. 
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1. M gate for larger codes 



If we were to use larger codes, e.g. the 25 qubit BS code, one would typically need to execute larger majority votings, i.e. of a 
longer distance. To achieve this purpose we developed a way of executing majority voting in a fault-tolerant way or, equivalently, 
a way of fault-tolerantly and unitarily correcting a quantum repetition code. We denote this gate as 9vt{N). 

M{N) : \si,s 2 ,...,s N ) N ®\£o) -> \m,...,m) N <2)\E{ s .y) = \m) , (Bl) 

where m = MBF{s\,S2, ...,£#}> l £ o) = ) and MBF is the majority boolean function. Note that when N — 2k i.e. is even, the 
MBF may not be solvable, i.e. when the string is balanced, in those cases the protocol will just take {si,S2, ■■■,S2k} to another 
balanced { s\ , s' 2 , ■ ■ ■ , s' 2k } ■ 

Consider the 5Vf gate in Fig. 5. The C gate can now be chosen to be a series of multi-controlled NOT gates targeting the data 
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FIG. 5: The boxed part of the circuit is in charge of the syndrome extraction, R corresponds to a cyclic permutation of the physical qubits 
and \s) denotes a codeword [sj ,...,,$#}■ The last unitary C, targeting the top line, will be the one in charge of executing the desired operation, 
depending of how we choose the controls as we will see below. All operations depicted here are bitwise and thus transversal. 

string. In general, it will be all possible fe-controlled-NOT gates targeting the data, where k is to be chosen from a set Kn which 
is characteristic for every string length N, e.g. ^3 = {2}, ^5 = {4, 3}, Kj = {6,4}. For the purposes of this paper the N — 3 case 
is of special interest and C is just a TOFFOLI gate which is assumed in our library of physical operations. For notation purposes 
we write M for M (3). Note that with this majority voting gadget we can build the corresponding EC gadget for larger BS codes, 
in the same way we used the 5Vf to build the 3 x 3 BS code QEC routine. 



2. Parity voter (!P) 

Other circuit which we do not use here, but may be of use is the Parity voter circuit, fP: 

T(N) : \si,s 2 ,...,s N ) N ®\zo) -> |9,-,?)jv® | £ fe}) = \$), ( B2 ) 

where q — s\ ©S2 © ... ©sjv- This gate can be executed slightly modifying the circuit in Fig. (5): we (i) swap the order of the C 
and the last set of CNOTs and, (ii) define C as all the possible CNOTs controlled by the ancilla qubits in a column and targeting 
the data string qubit corresponding to that column. Let us note that we can also vote the parity of a string \q\,q2, ■■■,qs) into it's 
last qubit via flLi CNOT iik . 



3. \Cat) state verification 



In contrast to the 9-qubit Bacon-Shor code other codes need an extra element: \cat) state verification. This verification stage 
in general provides a test which if passed gives an outpput state which when used an ancilla in the QEC process will not ruin the 
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fault-tolerance, and if failed indicates that the whole ancilla preparation & verification process must be restarted. The method we 
develop here will not be a "test" but rather a deterministic way of producing "verified" output |caf)-states. 
Depending on the EC gadget of choice one will typically need a way of verifying the state 

\cat) = (1/V2)(|0000) + |1111)). (B3) 

Typically these states are prepared and verified through a measurement: if the verifier qubit has not flipped then the state passes 
the test, if it has flipped then the preparation & verification process must be restarted. The whole idea is that a two bit-flip 
cat-state such as (|0011) + |1100))/\/2 has to pass the verification with probability p 2 or worse, such that the fault-tolerance is 
maintained. 
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FIG. 6: Shor's measurement aided \cat) preparation and verification. Two errors, a lethal scenario for quantum error correction, show up in the 
final state only when two or more operations are faulty. 

Note that this state is the result of a single failure of a CNOT during the preparation stage, however the measurement gives us 
a criterion for discarding it. An extra failure in the measurement must happen for the bad cat state to pass the test, but that is 
already a p 2 event so the analysis for fault-tolerance is valid. This process is non-deterministic in the sense that one error in the 
measurement can lead to the rejection of a perfectly good ancilla. As we want to avoid measurements we would ideally want 
to avoid such process altogether, thus we can execute the circuit in Fig. 7. This implies that we have a way of unitarily and 
deterministically, i.e. no discarding and restarting the process, preparing our verified \cat) state ancilla. 



4. M &.9t for other CSS codes 

Beyond the circuits that we presented here, in a deeper sense what our 9{j x ^ (9{j- z ^) circuit does is check the parity of all 
representations of logical X (Z) operators modulo stabilizer operators, and project this information into a X (Z) QR code. For 
example, the Steane code is defined by the stabilizers {IIIXXXX ,IXXIXX ,XIXIXIX ,IIIZZZZ,IZZIZZ,ZIZIZIZ}, so there 
are seven different implementations of Xl (Zi) using only three X (Z) physical gates. So to execute the 9(} X ' gate we execute 
the following protocol for all the seven logical operators. For the operator Xz r obtained by applying X operators on the qubits 
a,-, p,,Y, of the codeword we: (i) prepare an ancilla |000) state; (ii) a bitwise CNOT between the qubits a,-, p,, y,- of the codeword 
and the ancilla, the resulting state of the ancilla is a string \sj); (iii) vote the parity of \s,) into its last qubit, \p(sj)). Note that the 
process can be executed simultaneously, because no qubit is targeted twice and we are not concerned by propagation of Z-errors 
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FIG. 7: We replace the verification measurement by two successive actions of M, the circuit will output a bad cat if there are two failures 
within the circuit, i.e. a p 2 , or higher order event in p, as desired. The same method can be executed for larger cats still admitting only one 
error, using longer M cascades. For cat states admitting more errors we just use cascades of larger versions of M. 
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into the data. Finally we execute an 9Aq gate on the string to obtain the final state. So we have achieved the mapping 

7 

#:|*L>->fW(®bfa))) = |je) (B4) 
1=1 

This process is fault-tolerant and needs two errors for the gate to fail. To see this, note that one error in the data, or in (i) to (iii) 
may at most generate a string with three errors which can still be corrected by 5Vf , e.g. |0l) 1 error^ 1 1 1 10000) 0^. An extra 

error during the simultaneous (i)-(iii) or in the 5Vf will generate the wrong execution of the 

This completes the basic tools needed to adapt our scheme to other CSS codes. Since unitary QEC circuits already exist in 
the literature for other CSS codes [1, 12] using TOFFOLI gates, one now has to simply replace the TOFFOLI by bTOFFOLI to 
execute them fault-tolerantly at any level of concatenation. Note that the results regarding the timing on a 9{,(k) gate still hold for 
this case since (ii)+(iii) take only 2 x T(G(k— 1)), it is also true that T(EC(k)) = T(ECx(k)) + dT(G(k — 1)) for some integer d, 
which will allow the same analysis as for the BS code. The form of 9\[ can be modified for a particular QEC circuit in order to 
optimize the error count. 



